158 research outputs found

    Data mining of the GAW14 simulated data using rough set theory and tree-based methods

    Get PDF
    Rough set theory and decision trees are data mining methods used for dealing with vagueness and uncertainty. They have been utilized to unearth hidden patterns in complicated datasets collected for industrial processes. The Genetic Analysis Workshop 14 simulated data were generated using a system that implemented multiple correlations among four consequential layers of genetic data (disease-related loci, endophenotypes, phenotypes, and one disease trait). When information of one layer was blocked and uncertainty was created in the correlations among these layers, the correlation between the first and last layers (susceptibility genes and the disease trait in this case), was not easily directly detected. In this study, we proposed a two-stage process that applied rough set theory and decision trees to identify genes susceptible to the disease trait. During the first stage, based on phenotypes of subjects and their parents, decision trees were built to predict trait values. Phenotypes retained in the decision trees were then advanced to the second stage, where rough set theory was applied to discover the minimal subsets of genes associated with the disease trait. For comparison, decision trees were also constructed to map susceptible genes during the second stage. Our results showed that the decision trees of the first stage had accuracy rates of about 99% in predicting the disease trait. The decision trees and rough set theory failed to identify the true disease-related loci

    A genome-wide scan using tree-based association analysis for candidate loci related to fasting plasma glucose levels

    Get PDF
    BACKGROUND: In the analysis of complex traits such as fasting plasma glucose levels, researchers often adjust the trait for some important covariates before assessing gene susceptibility, and may at times encounter confounding among the covariates and the susceptible genes. Previously, the tree-based method has been employed to accommodate the heterogeneity in complex traits. In this study, we performed a genome-wide screen on fasting glucose levels in the offspring generation of the Framingham Heart Study provided by the Genetic Analysis Workshop 13. We defined one quantitative trait and converted it to a dichotomous trait based on a predetermined cut-off value, and performed association analyses using regression and classification trees for the two traits, respectively. A marker was interpreted as positive if at least one of its alleles exhibited association in both analyses. Our purpose was to identify candidate genes susceptible to fasting glucose levels in the presence of other covariates. The covariates entered in the analysis including sex, body mass index, and lipids (total plasma cholesterol, high density lipoprotein cholesterol, and triglycerides) of the subjects, and those of their parents. RESULTS: Four out of seven positive regions in chromosomes 1, 2, 6, 11, 16, 18, and 19 from our analyses harbored or were very close to previously reported diabetes related genes or potential candidate genes. CONCLUSION: This screen method that employed tree-based association showed promise for identifying candidate loci in the presence of covariates in genome scans for complex traits

    Construction of endophenotypes for complex diseases in the presence of heterogeneity

    Get PDF
    Endophenotypes such as behavior disorders have been increasingly adopted in genetic studies for complex traits. For efficient gene mapping, it is essential that an endophenotype is associated with the disease of interest and is inheritable or co-segregating within families. In this study, we proposed a strategy to construct endophenotypes to analyze the Genetic Analysis Workshop 14 simulated dataset. Initially, generalized estimating equation models were employed to identify phenotypes that were correlated to the disease (affected status) in combination with the family structures in data. Endophenotypes were then constructed with consideration of heterogeneity as functions of the identified phenotypes. Genome scans on the constructed endophenotypes were carried out using family-based association analysis. For comparison, genome scans were also performed with the original affected status. The family-based association analysis using the endophenotypes correctly identified the same susceptible gene in about 80 of the 100 replicates

    An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays

    Get PDF
    The cross-sample plot of the multipoint LOH/LCSH analyses of the three samples used in Fig. 5. The plot comprises four panels: (a) The top-left panel is a cross-sample and cross-chromosome plot. The vertical axis is the index of study samples, and the horizontal axis is the physical position (Mb) on each of the 23 chromosomes. The blue and red bars represent SNPs without and with LOH/LSCH, respectively. (b) The top-right panel is a histogram of cross-chromosome aberration frequency. The vertical axis is the index of study samples, and the horizontal axis is the cross-chromosome aberration frequency of the corresponding samples. The pink (skyblue) background represents that the genetic gender of a sample is female (male). The histogram represents the aberration frequency of LOH/LCSH SNPs across the chromosomes of the corresponding samples. (c) The bottom-left panel is a histogram of the cross-sample aberration frequency. The vertical axis is the cross-sample aberration frequency of a SNP, and the horizontal axis is the physical position (Mb) on each of the 23 chromosomes. The purple line represents the aberration proportion of samples carrying the SNPs with LOH/LCSH. (d) The bottom-right panel is the legend of the genetic gender that is used in panel (b), where the pink (skyblue) background represents that the genetic gender of a sample is female (male). (TIFF 1656 kb

    Identification of Novel Susceptibility Loci for Kawasaki Disease in a Han Chinese Population by a Genome-Wide Association Study

    Get PDF
    Kawasaki disease (KD) is an acute systemic vasculitis syndrome that primarily affects infants and young children. Its etiology is unknown; however, epidemiological findings suggest that genetic predisposition underlies disease susceptibility. Taiwan has the third-highest incidence of KD in the world, after Japan and Korea. To investigate novel mechanisms that might predispose individuals to KD, we conducted a genome-wide association study (GWAS) in 250 KD patients and 446 controls in a Han Chinese population residing in Taiwan, and further validated our findings in an independent Han Chinese cohort of 208 cases and 366 controls. The most strongly associated single-nucleotide polymorphisms (SNPs) detected in the joint analysis corresponded to three novel loci. Among these KD-associated SNPs three were close to the COPB2 (coatomer protein complex beta-2 subunit) gene: rs1873668 (p = 9.52×10−5), rs4243399 (p = 9.93×10−5), and rs16849083 (p = 9.93×10−5). We also identified a SNP in the intronic region of the ERAP1 (endoplasmic reticulum amino peptidase 1) gene (rs149481, pbest = 4.61×10−5). Six SNPs (rs17113284, rs8005468, rs10129255, rs2007467, rs10150241, and rs12590667) clustered in an area containing immunoglobulin heavy chain variable regions genes, with pbest-values between 2.08×10−5 and 8.93×10−6, were also identified. This is the first KD GWAS performed in a Han Chinese population. The novel KD candidates we identified have been implicated in T cell receptor signaling, regulation of proinflammatory cytokines, as well as antibody-mediated immune responses. These findings may lead to a better understanding of the underlying molecular pathogenesis of KD

    Garlic Accelerates Red Blood Cell Turnover and Splenic Erythropoietic Gene Expression in Mice: Evidence for Erythropoietin-Independent Erythropoiesis

    Get PDF
    Garlic (Allium sativum) has been valued in many cultures both for its health effects and as a culinary flavor enhancer. Garlic's chemical complexity is widely thought to be the source of its many health benefits, which include, but are not limited to, anti-platelet, procirculatory, anti-inflammatory, anti-apoptotic, neuro-protective, and anti-cancer effects. While a growing body of scientific evidence strongly upholds the herb's broad and potent capacity to influence health, the common mechanisms underlying these diverse effects remain disjointed and relatively poorly understood. We adopted a phenotype-driven approach to investigate the effects of garlic in a mouse model. We examined RBC indices and morphologies, spleen histochemistry, RBC half-lives and gene expression profiles, followed up by qPCR and immunoblot validation. The RBCs of garlic-fed mice register shorter half-lives than the control. But they have normal blood chemistry and RBC indices. Their spleens manifest increased heme oxygenase 1, higher levels of iron and bilirubin, and presumably higher CO, a pleiotropic gasotransmitter. Heat shock genes and those critical for erythropoiesis are elevated in spleens but not in bone marrow. The garlic-fed mice have lower plasma erythropoietin than the controls, however. Chronic exposure to CO of mice on garlic-free diet was sufficient to cause increased RBC indices but again with a lower plasma erythropoietin level than air-treated controls. Furthermore, dietary garlic supplementation and CO treatment showed additive effects on reducing plasma erythropoietin levels in mice. Thus, garlic consumption not only causes increased energy demand from the faster RBC turnover but also increases the production of CO, which in turn stimulates splenic erythropoiesis by an erythropoietin-independent mechanism, thus completing the sequence of feedback regulation for RBC metabolism. Being a pleiotropic gasotransmitter, CO may be a second messenger for garlic's other physiological effects

    Genome-Wide Association Study of Treatment Refractory Schizophrenia in Han Chinese

    Get PDF
    We report the first genome-wide association study of a joint analysis using 795 Han Chinese individuals with treatment-refractory schizophrenia (TRS) and 806 controls. Three loci showed suggestive significant association with TRS were identified. These loci include: rs10218843 (P = 3.04×10−7) and rs11265461 (P = 1.94×10−7) are adjacent to signaling lymphocytic activation molecule family member 1 (SLAMF1); rs4699030 (P = 1.94×10−6) and rs230529 (P = 1.74×10−7) are located in the gene nuclear factor of kappa light polypeptide gene enhancer in B-cells 1 (NFKB1); and rs13049286 (P = 3.05×10−5) and rs3827219 (P = 1.66×10−5) fall in receptor-interacting serine/threonine-protein kinase 4 (RIPK4). One isolated single nucleotide polymorphism (SNP), rs739617 (P = 3.87×10−5) was also identified to be associated with TRS. The -94delATTG allele (rs28362691) located in the promoter region of NFKB1 was identified by resequencing and was found to associate with TRS (P = 4.85×10−6). The promoter assay demonstrated that the -94delATTG allele had a significant lower promoter activity than the -94insATTG allele in the SH-SY5Y cells. This study suggests that rs28362691 in NFKB1 might be involved in the development of TRS

    Convergent Evidence from Mouse and Human Studies Suggests the Involvement of Zinc Finger Protein 326 Gene in Antidepressant Treatment Response

    Get PDF
    OBJECTIVES: The forced swim test (FST) is a commonly used model to predict antidepressant efficacy. Uncovering the genetic basis of the model may unravel the mechanism of antidepressant treatment. METHODS: FVB/NJ (FVB) and C57BL/6J (B6) were first identified as the response and non-response strains to fluoxetine (a serotonin-specific reuptake inhibitor antidepressant) treatment in the mouse FST. Simple-interval (SIM) and composite-interval (CIM) mappings were applied to map the quantitative trait loci (QTLs) of the anti-immobility effect of fluoxetine in FST (FST(FLX)) in 865 male B6×FVB-F2 mice. The brain mRNA expressions of the gene with the maximum QTL-linkage signal for FST(FLX) after the FST were compared between B6 and FVB mice and also compared between fluoxetine and saline treatment. The association of the variants in the human homologue of the mouse FST(FLX)-QTL gene with major depressive disorder (MDD) and antidepressant response were investigated in 1080 human subjects (MDD/control = 582/498). RESULTS: One linkage signal for FST(FLX)-QTL was detected at an intronic SNP (rs6215396) of the mouse Zfp326 gene (maximal CIM-LOD = 9.36). The Zfp326 mRNA expression in the FVB thalamus was significantly down-regulated by fluoxetine in the FST, and the higher FVB-to-B6 Zfp326 mRNA expressions in the frontal cortex, striatum and hypothalamus diminished after fluoxetine treatment. Two coding-synonymous SNPs (rs2816881 and rs10922744) in the human homologue of Zfp326, ZNF326, were significantly associated with the 8-week antidepressant treatment response in the MDD patients (Bonferroni-corrected p = 0.004-0.028). CONCLUSIONS: The findings suggest the involvement of the Zfp326 and ZNF326 genes in antidepressant treatment response
    corecore